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portion of C4bp used was placed C-terminal to the scFv sequence, optionally spaced by a 
MYC tag. 

Many proteins require the assistance of molecular chaperones in order to be fold in vivo 
or to be refolded vitro in high yields. Molecular chaperones are proteins, which are 
often large and require an energy source such as ATP to function. A key molecular 
chaperone m Escherichia coli is GroEL, which consists of 14 subunits each of some 57 5 
kD molecular mass arranged in two seven membered rings. There is a large cavity in the 
GroEL nng system, and it is widely believed that the cavity is required for successful 
protem folding activity. For optimal activity, a co-chaperone, GroES, is required which 
consists of a seven membered ring of 10 kD subunits. The activity of the GroEL/GroES 
complex requires energy source ATP. GroEL and GroES are widespread throughout all 
organisms, and often referred to as chaperonin (cpn) molecules, cpn 60 and cpn 10 
respectively. 

GroEL is an allosteric protein. Allosteric proteins are a special class of oligomeric 
protems, which alternate between two or more different three-dimensional structures on 
the bmdmg of ligands and substrates. Allosteric proteins are often involved in control 
processes in biology or where mechanical and physico-chemical energies are 
interconverted. The role of ATP is to trigger this allosteric change, causing GroEL to 
convert from a state that binds denatured proteins tightly to one that binds denatured 
proteins weakly. The co-chaperone. GroES, aids in this process by favouring the weak- 
bmdmg state. It may also act as a cap. sealing off the cavity of GroEL. Further its 
binding to GroEL is likely directly to compete with the binding of denatured substrates 
The net result is that the binding of GroES and ATP to GroEL which has a substrate 
bound m Its denatured form is to release the denatured substrate either into the cavity or 
into solution where it can refold. 
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Mimchaperones have been described in detail elsewhere (see International patent 
apphcation WO99/05163, the disclosure of which in incorporated herein by reference) 
Mmichaperone polypeptides possess chaperoning activity when in monomeric fonn and 
do not require energy in the form of ATP. Defined fragments of the apical domain of 
GroEL of approximately 143-186 amino acid residues in length have molecular 
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chaperone activity towards proteins either in solution under monomeric conditions or 
when monodisperse and attached to a support. 

Summary of the Invention 

The activity of minichaperones, although sufficient for many purposes, is inferior to that 
of intact GroEL. It is postulated that this could be due to the inability of GroES to 
oligomerise. There is thus a widespread requirement for a system which would allow the 
oligomerisation of polypeptides to form functional protein oligomers which have 
activities which surpass those of recombinant monomeric polypeptides. 

According to the present invention, there is provided a polypeptide monomer capable of 
oligomerisation. said monomer comprising an heterologous amino acid sequence inserted 
into the sequence of a subunit of an oligomerisable protein scaffold. 

It has been oberserved that the oligomensation of heterologous polypeptides allows their 
spatial juxtaposition, which may potentiate their activity. Where the activity is or 
involves binding, oligomensation significantly increases the avidity of binding over that 
which is observed with monomers. Moreover, if the oligomer is heterogeneous, 
oligomeric constructs according to the invention permit the juxtaposition of a plurality of 
biological activities which can be brought to bear on a single molecule 
contemporaneously. 

A protein scaffold is a protein, or part thereof, whose function is to detennine the 
structure of the protein itself, or of a group of associated proteins or other molecules. 
Scaffolds therefore have a defined three-dimensional structure when assembled, and have 
the capacity to support molecules or polypeptide domains m or on tiie said structure. 
Advatageously, a scaffold has the ability to assume a variety of viable geometries, in 
relation to the three-dimensional structure of the scaffold and/or the insertion site of the 
30 heterologous polypeptides. 

Preferably, the scaffold according to the invention is a chaperonin cpnlO/HsplO scaffold. 
rpnlO is a widespread component of the cpn60/cpnl0 chaperonin system. Examples of 
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cpnlO include bacterial GroES and bacteriophage T4 Gp31. Further members of the 
cpnlO family will be known to those skilled in the art. 

The invention moreover comprises the use of derivatives of naturally-occuring scaffolds. 
Derivatives of scaffolds (including scaffolds of the cpnlO and 60 families) comprise 
mutants thereof, which may contain amino acid deletions, additions or substitutions 
(especially replacement of Cys residues in Gp31). hybrids formed by fusion of different 
members of the CpnlO or 60 families and/or circular pemiutated protein scaffolds, subject 
to the maintenance of the "oligomerisation" property described herein. 

Protein scaffold subunits assemble to fomi a protein scaffold. In the context of the 
present invention, the scaffold may have any shape and may comprise any number of 
subunits. Preferably, the scaffold comprises between 2 and 20 subunits, advantageously 
between 5 and 15 subunits, and ideally about 10 subunits. The scaffold of cpnlO family 
members comprises seven subunits, in the shape of a seven-membered ring or amiulus. 
Advantageously, therefore, the scaffold is a seven-membered ring. 



Advantageously, the heterologous amino acid sequence is inserted into the sequence of 
the oligomerisable protein scaffold subunit such that both the N and C temiini of the 
20 polypeptide monomer are fomied by the sequence of the oligomerisable protein scaffold 
subunit. Thus, the heterologous polypeptide is included with the sequence of the scaffold 
subunit, for example by replacing one or more amino acids thereof. 
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It is known that cpnlO subunits possess a "mobile loop" within their structure. The 
mobile loop is positioned between amino acids 15 and 34, preferably between amino 
acids 16 to 33, of the sequence of E. coli GroES, and equivalent positions on other 
members of the cpnlO family. The mobile loop of T4 Gp31 is located between reidues 22 
to 45, advantageously 23 to 44. Advantageously, the heterologous polypeptide is inserted 
by replacing all or part of the mobile loop of a cpnlO family polypeptide. 

Where the protein scaffold subunit is a cpnlO family polypeptide, the heterologous 
sequence may moreover be incorporated at the N or C terminus thereof, or in positions 
which are equivalent to the roof p hairpin of cpnlO family peptides. This position is 
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located between positions 54 and 67, advantageously 55 to 66, and preferably 59 and 61 
of bacteriophage T4 Gp31, or beUveen positions 43 to 63, preferably 44 to 62, 
advanrageously 56 to 57 of coli GroES. 

Advantageously, the polypeptide may be inseretd at an N or C terminus of a scaffold 
subunit in association with circular permutation of the subunit itself. Circular 
pennutation is described in Graf and Schachman, PNAS(USA) 1996, 93:11591. 
Essentially, the polypeptide is circularised by fusion of the exisiting N and C termini, and 
cleavage of the polypeptide chain elsewhere to creat novel N and C termini. In a 
preferred embodiment of the invention, the heterologous polypeptide may be included at 
the N and/or C terminus formed after circular permutation. The site of formation of the 
novel termini may be selected according to the features desired, and may include the 
mobile loop and/or the roof P hairpin. 

Advantageously, heterologous sequences, which may be the same or different, may be 
inserted at more than one of the positions above-identified in the protein scaffold subunit. 
Thus, each subunit may comprise two or more heterologous polypeptides, which are 
displayed on the scaffold when this is assembled. 

Heterologous polypeptides may be displayed on a scaffold subunit in free or constrained 
form, depending on the degree of freedom provided by the site of insertion into the 
scaffold sequence. For example, varying the length of the sequences flanking the mobile 
loop in the scaffold will modulate the degree of constraint of any heterologous 
polypeptide inserted therein. 

In a second aspect, the invention relates to a polypeptide oligomer comprising two or 
more monomers according to the first aspect of the invention. The oligomer may be 
configured as a heterooligomer, comprising two or more different amino acid sequences 
inserted into the scaffold, or as a homooligomer. in which the sequences inserted into the 
scaffold are the same. 



If the oligomer according to the invention is a heterooligomer, it may be configured such 
that the polypeptides juxtaposed thereon have complementary biological activities. For 



example, two enzymes which act on the same substrate in succession are advantageously 
displayed ori the same scaffold, enabling them to act in concert. 

The monomers which constitute the oligomer may be covalently crosslinked to each 
other. Cross linking may be performed by recombinant approaches, such that the 
monomers are expressed ab initio as an oligomer; alternatively, cross-Unking may be 
performed at cys residues in the scaffold. For example, unique cys residues inserted 
between positions 56 and 57 of the GroES scaffold, or equivalent positions on other 
members of the cpnl 0 family, may be used to cross-link scaffold subunits. 

The nature of the heterologous polypeptide inserted into the scaffold subunit in 
accordance with the present invention may be selected at will. Examples of possible 
applications of the technology of the invention are set forth below; however, it will be 
apparent to the person skilled in the art that many different applications of the invention 
can be envisaged and, with the benefit of the present disclosure, put into practice in a 
straight forward maimer. 

Particularly advantageous embodiments of the invention include proteins which display 
antibodies, particularly fragments thereof such as scFv, Vh domains and Vh CDR3 
fragments; antigens, for example for vaccination; and polypeptides which have a 
biological activity, such as enzjones. 

In a further aspect, the present invention relates to a method for preparing a polypeptide 
monomer capable of oligomerisation according to the first aspect of the invention, 
comprising the steps of inserting a nucleic acid sequence encoding a heterologous 
polypeptide into a nucleic acid sequence encoding a subunit of an oligomerisable protein 
scaffold, incorporating the resulting nucleic acid into an expression vector, and 
expressing the nucleic acid to produce the polypeptide monomers. 

The invention moreover relates to a method for producing a polypeptide oligomer 
according to the second aspect of the invention, comprising allowing the polypeptide 
monomers produced as above to associate into an oligomer. Preferably, the monomers 
are cross-linked to form the oligomer. 
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Brief Description of the Drawings 

Figure 1. (a) Three-dimensional structure of Gp31 of bacteriophage T4 solved at 2.3 A. 
Positions mentioned in the text are indicated (residues numbered as in van der Vies, S., 
Gatenby, A. & Georgopoulos, C. (1994) Nature 368, 654-656). (b) Three-dimensional 
structure of minichaperone GroEL(191-376) solved at 1.7 A. The distance between 
residues 25 and 43 of GpSl is around 12 A; the distance between residues 191 and 376 of 
GroEL is around 9 A. Positions mentioned in the text are indicated (residues numbered as 
inHemmingsen, S. M., Woolford, C. van, d. V. S., Tilly, K., Dennis, D. T., 
Georgopoulos, C. P.. Hendrix, R. W. & Ellis, R. J. (1988) Nature 333, 330-334.). 
Secondary structure representations are drawn with MolScript (Kraulis, P. (1991) J. Appl. 
Crystallogr. 24, 946-950). 

Figure 2. Schematic representation of Gp31 proteins in the vectors used in this study. The 
presence of the Gp31 mobile loop (residues 23 to 44) and/or minichaperone GroEL 
(residues 191 to 376) are indicated by boxes. The nucleotide sequence of the Gp31 
mobile loop and relevant restriction sites are shown. The names of the corresponding 
vector are listed in the left margin. 

Figure 3. (a) Molecular weight determination by analytical gel filtration chromatography. 
Wild-type proteins Gp31 {M, ^1 x 12 kDa) and GroEL(191-376) (M, =22 kDa) and, 
Gp31Aloop and Gp3 1 A::GroEL(191-376) (MC7) were run on a Superdex™ 2OO (HR 
10/30) column (Pharmacia Biotech.) calibrated with molecular weight standards (solid- 
line and circles). Gp31Aloop and MC, eluted at volumes corresponding to molecular 
weights of ==145.6 and ==215 kDa, respectively, (b) Molecular weight determination of 
MC7by equilibrium analytical ultracentrifugation. The apparent molecular weight of MC7 
is =215 kDa. 



30 Figure 4. Characterization of MCv by CD spectroscopy, (a) Far UV-CD spectrum at 25 
°C. (b) Thermal denaturation followed at 222 nm at a heating rate of 1 "Cmin"'. 
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Figure 5. (a) Binding specificity of MC7 to GroES determined by ELISA. (b) Inhibition 
of MC7 binding to heptameric co-chaperonin GroES by varying concentrations of 
synthetic peptide corresponding to residues 16 to 32 of GroES mobile loop determined by 
competition ELISA. 



Figure 6. Binding avidity of MC7 to anti-GroEL antibodies determined (a) by direct 
ELISA or (b) by indirect ELISA. 



Figure 7. In vitro refolding of heat- and dithiothreitol-denatured mtMDH. (a) Protection 
of aggregation at 47 °C followed by ligth scattering at 550 nm. (b) Time-dependent 
reactivation of mtMDH at 25 °C. (c) Yields of mtMDH reactivation. 

Figure 8 and Figure 9 show the possible insertion sites for heterologous polypeptides in 
bacteriophege T4 Gp3 1 . 

Figure 10 illustrates the potential attachment sites for polypeptides to a circular scaffold, 
in this case Gp3 1 . 



Figure 11 shows a number of applications of scaffolded polypeptides, including 
oligomerisation of antibody binding domains, optionally including a label such as GFP, 
and potentially purification tags. 

Figure 12 illustrates fiirther applications for scaffolded polypeptides, including the 
formation of heterooligomers having a plurality of different functionalities, and the use of 
a circularly permuted subunit. 



Detailed Description of the Invention 
Definitions 



Oligomerisable scaffold. An oligomerisable scaffold, as referred to herein, is a 
polypeptide which is capable of oligomerising to form a scaffold and to which a 
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heterologous polypeptide may be fused, preferably covalently, without abolishing the 
oligomerisation capabilities. Thus, it provides a "scaffold" using which polypeptides may 
be arranged into multimers in accordance with the present invention. Optionally, parts of 
the wild-type polypeptide from which the scaffold is derived may be removed, for 
5 example by replacement with the heterologous polypeptide which is to be presented on 
the scaffold. 

Monomer. Monomers according to the present invention are polypeptides which 
possess the potential to oligomerise. This is brought about by the incorporation, in the 
10 polypeptide, of an oligomerisable scaffold subunit which will oligomerise with further 
scaffold subunits if combined therewith. 

Oligomer, As used herein, "oligomer" is synonymous with "polymer" or "multimer" 
and is used to indicate that the object in question is not monomeric. Thus, oligomeric 
15 polypeptides according to the invention comprise at least two monomeric units joined 

together covalently or non-covalently. The number of monomeric units employed will 
depend on the intended use of the oligomer, and may be between 2 and 20 or more. 
Advantageously, it is between 5 and 10, and preferably about 7. 

20 Polypeptide. As used herein, a polypeptide is a molecule comprising at lest one peptide 
bond linking two amino acids. This term is synonymous with "protein" and "peptide", 
both of which are used in the art to describe such molecules. A polypeptide may 
comprise other, non-amino acid components. A heterologous polypeptide is a 
polypeptide which is heterologous to the protein scaffold used in the invention. In other 

25 words, it is not part of the same molecule in nature. It may be derived from the same 
organism. Examples of polypeptides include those used for medical or biotechnological 
use, such as interleukins, interferons, antibodies and their fragments, insulin, transforming 
growth factor, antigens, immunogens and many toxins and proteases. 

30 Description of Preferred Embodiments 



Scaffold Proteins 
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In a preferred embodiment, the scaffold polypeptide is based on members of the 
cpnlO/HsplO family, such as GroES or an analogue thereof A highly preferred analogue 
is the T4 polypeptide Gp31. GroES analogues, including Gp31, possess a mobile loop 
(Hunt, J. F., et al, (1997) Cell 90, 361-371; Landry, S. J., et al, (1996) Proc. Natl. 
5 Acad.Sci. U.S.A. 93, 11622-1 1627) which may be inserted into, or replaced, in order to 
fuse the heterologous polypeptide to the scaffold. 

CpnlO homologues are widespread throughout animals, plants and bacteria. For example, 
a search of GenBank indicates that cpnlO homologues are known in the following 
10 species: 

Actinobacillus actinomycetemcomitans; Actinobacillus pleuropneumoniae; Aeromonas 
salmonicida: Agrobacterium tumefaciens; Allochromatium vinosum; Amoeba proteus 
symbiotic bacterium; Aquifex aeolicus; Arabidopsis thaliana; Bacillus sp; Bacillus 
stearothermophilus; Bacillus subtilis; Bartonella henselae; Bordetella pertussis; Borrelia 
burgdorferi; Brucella abortus; Buchnera aphidicola; Burkholderia cepacia; 
Burkholderia vietnamiensis; Campylobacter jejuni; Caulobacter crescentus; Chlamydia 
muridarum; Chlamydia trachomatis; Chlamydophila pneumoniae; Clostridium 
acetobutylicum; Clostridium perfringens; Clostridium thermocellum; coliphage T; 
Cowdria ruminantium; Cyanelle Cyanophora paradoxa; Ehrlichia canis; Ehrlichia 
chaffeensis; Ehrlichia equi; Ehrlichia phagocytophila; Ehrlichia risticii; Ehrlichia 
sennetsu; Ehrlichia sp 'HGE agent'; Enterobacter aerogenes; Enterobacter agglomerans; 
Enterobacter amnigenus; Enterobacter asburiae; Enterobacter gergoviae; Enterobacter 
intermedius; Erwinia aphidicola; Erwinia carotovora; Erwinia herbicola; Escherichia ■ 
coli; Francisella tularensis; Glycine max; Haemophilus ducreyi; Haemophilus influenzae 
Rd; Helicobacter pylori ; Holospora obtusa; Homo sapiens; Klebsiella ornithinolytica; 
Klebsiella oxytoca; Klebsiella planticola; Klebsiella pneumoniae; Lactobacillus 
helveticus; Lactobacillus zeae; Lactococcus lactis; Lawsonia intracellularis; Leptospira 
interrogans; Methylovorus sp strain SS; Mycobacterium avium; Mycobacterium avium 
subsp avium; Mycobacterium avium subsp paratuberculosis; Mycobacterium leprae; 
Mycobacterium tuberculosis; Mycoplasma genitalium; Mycoplasma pneumoniae; Myzus 
persicae primary endosymbiont; Neisseria gonorrhoeae; Oscillatoria sp NKBG; Pantoea 
ananas; Pasteurella multocida; Porphyromonas gingivalis; Pseudomonas aeruginosa; 
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Pseudomonas aeruginosa; Pseudomonas putida; Rattus norvegicus; Rattus norvegicus; 
Rhizobium leguminosarum: Rhodobacter capsulatus: Rhodobacter sphaeroides; 
Rhodothermus marinus; Rickettsia prowazekii; Rickettsia rickettsii; Saccharomyces 
cerevisiae; Serratia ficaria; Serratia marcescens; Serratia rubidaea; Sinorhizobium 
meliloti; Sitophilus oryzae principal endosymbiont; Stenotrophomonas maltophilia; 
Streptococcus pneumoniae; Streptomyces albus; Streptomyces coelicolor; Streptomyces 
coelicolor; Streptomyces lividans; Synechococcus sp; Synechococcus vulcanus; 
Synechocystis sp; Thermoanaerobacter brockii; Thermotoga maritima; Thermus 
aquaticus; Treponema pallidum; Wolbachia sp; Zymomonas mobilis. 

An advantage of cpnlO family subunits is that they possess a mobile loop, responsible for 
the protein folding activity of the natural chaperonin, which may be removed without 
affecting the scaffold. 

CpnlO with a deleted mobile loop possesses no biological activity, making it an 
advantageously inert scaffold, thus minimising any potentially deleterious effects. 
Insertion of an appropriate biologically active polypeptide can confer a biological activity 
on the novel polypeptide thus generated. Indeed, the biological activity of the inserted 
polypeptide may be improved by incorporation of the biologically active polypeptide into 
the scaffold. 



Alternative sites for peptide insertion are possible. An advantageous option is in the 
position equivalent to the roof beta hairpin in GroES. This involves replacement of Glu- 
60 m Gp31 by the desired peptide. The amino acid sequence is Pro(59)-Glu(60)-Gly(61). 
This is conveniently converted to a Smal site at the DNA level (CCC.GGG) encoding 
Pro-Gly, leaving a blunt-ended restriction site for peptide insertion as a DNA fragment. 
Similarly, an insertion may be made at between positions 56 and 57 of the GroES 
sequence, and at equivalent positions in other cpnlO family members. Alternatively, 
inverse PGR may be used, to display the peptide on the opposite side of the scaffold. 



Members of the cpn60/Hsp60 family of chaperonin molecules may also be used as 
scaffolds. For example, the tetradecameric bacterial chaperonin GroEL may be used. 
Advantageously, heterologous polypeptides would be inserted between positions 191 and 
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376, in particular between positions 197 and 333 (represented by SacII engineered and 
unique Cla I sites) to maintain intact the hinge region between the equatorial and the 
apical domains in order to impart mobility to the inserted polypeptide. The choice of 
scaffold may depend upon the intended application of the oHgomer: for example, if the 
oligomer is intended for vccination purposes (see below), the use of an immunogenic 
scaffold, such as that derived from Mycobacterium tuberculosis, is highly advantageous 
and confers an adjuvant effect. 

Mutants of cpn60 molecules may also be used. For example, the single ring mutant of 
GroEL (GroELSRl) contains four point mutations which effect the major attachment 
between the two rings of GroEL (R452E, E461 A, S463A and V464A) and is functionally 
inactive in vitro because it is release to bind GroES. GroELSR2 has an additional 
mutation at Glul91-Gly, which restores activity by reducing the affinity for GroES. Both 
of these mutants for ring structures and would be suitable for use as scaffolds. 

Polypeptides 

Any polypeptide may be incorporated into the structure of the monomers or oligomers as 
described above. The following classes of polypeptide are preferred, but the invention is 
not limited thereto. 

Immunogens Immunogenic peptides, capable of raising an immune response when 
exposed to the immune system of an organism, are preferred polypeptides for insertion 
into monomers and oligomers according to the invention. This aspect of the invention . 
has many applications, not only in vaccination but also in research. For example, the 
generation of human gene sequence data by the human genome project has made the 
generation of antisera reactive to new polypeptides a pressing requirement. The same 
requirement applies to prokaryotic, such as bacterial, and other eukaryotic, including 
fungal, gene products. Incorporation of more than one polypeptide immunogen into a 
scaffold increases the efficiency of the immunogens, due to increased avidity for 
immunoglobulin molecules. 
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The present invention has many advantages in the generation of an immune response. 
For example, the use of oligomers can permit the presentation of a number of antigens, 
simultaneously, to the immune system. This allows the preparation of polyvalent 
vaccines, capable of raising an immune response to more than one epitope, which may be 
present on a smgle organism or a number of different organisms. Thus, vaccines 
according to the invention may be used for simultaneous vaccination against more than 
one disease, or to target simultaneously a plurality of epitopes on a given pathogen. A 
preferred group of antigenic polypeptides is the V3 loops of various HIV subgroups, 
which can be immunised against simultaneously by the method of the present invention. 

Display of the V3 loop of the envelope glycoprotein gpl20 of HIV-1 (and -2) on a 
polypeptide scaffold is highly advantageous: 

1 . The production of a series of variant loops allows both sensitive detection of anti- 
HIV antibodies and simultaneous typing of the infecting subgroup of HIV on an 
array of loops. 

2. As an antigen for eliciting polyclonal or monoclonal antibodies, these scaffolded 
loops provide very specific epitopes for immunisation and vaccination . 

3. The scaffolded loops can be developed further to provide a screening assay of 
very high throughput to detect which are potential antiviral agents. 



The V3 loop of HIV-1 gpl20 is the major (but not exclusive) determinant of viral 
tropism. A substantial body of literature demonstrates that mitial binding of CD4 (the 
primary HIV receptor) to gpl20 alters the conformation of the latter, exposing the V3 
loop which binds then to one of a number of chemokine receptors on the same cell 
surface. The chemokine receptor (sometimes called the co-receptor) is usually CCR5 on 
macrophages and CXCR4 on T-cells, the two most important cell types infected by HIV. 
Dual tropic strains of HIV exist which can use either co-receptor, and consequently will 
infect both cell types. 
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Importantly, while V3 loops are highly variable (entire sections of the Los Alamos HIV 
database are devoted to recording the variability; see http://hiv-web.lanl.gov) the co- 
receptors, being host-encoded are not. Compounds which bind tightly to the host's 
chemokine receptors should therefore be capable of foiling viral entry. In fact the natural 
ligands for these receptors (RANTES, MlP-lalpha and MlP-lbeta for CCR5; SDF or 
Stem-cell derived factor for CXCR4) do just that. 

Scaffolded V3 loops with, for example GFP on the opposite side of the scaffold, act as 
surrogate labels; compounds may be screened in large numbers for their ability to 
displace binding of the scaffolded loops to the receptor (see J. Virol 1997 Volume 71, 
pages 6296-6304 where radio-labelled chemokines were used in such a displacement 
assay). Labelled chemokines provide useful controls for the specificity of the assay: they 
should displace the scaffold from the appropriate receptor. 

Structure/function studies can be carried out by mutagenesis of the loop (see, for 
example, EMBO Journal, 1997 Volume 16 pages 2599-2609). 

Moreover, the invention may be exploited by incorporating an adjuvant on the scaffold, 
together with the immunogen. Suitable adjuvants are, for example, bacterial toxins and 
cytokines, such as interleukins. The potency of the immunogen is thereby increased, 
allowing more efficient raising of antisera and more efficient immunisation. 

Preferably, in the context of immunisations, a bacterial or bacteriophage scaffold is used. 
Such scaffolds are unlikely to encounter endogenous host antibodies upon administration, ■ 
since naturally-occurring antibodies to these molecules are rare. 

The invention may be applied to the detection or the neutralisation of antibodies in vivo or 
in vitro. For example, in vitro, polyvalent or monovalent antigen-bearing scaffolds may 
be used to select antibody molecules derived from phage display experiments. Moreover, 
in vivo, antigen-bearing scaffolds according to the invention may be used to neutrahse 
antoantibodies in autoimmune disease, or to detect antibodies which may be indicative of 
pathological conditions, such as in HIV testing or other diagnostic applications. 
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Antibodies The affinity of antibodies or antibody fragments for antigens may be 
increased by oHgomerisation according to the present invention. Antibody fragments 
may be fragments such as Fv, Fab and F(ab')2 fragments or any derivatives thereof, such 
as a single chain Fv fragments. The antibodies or antibody fi-agments may be non- 
recombinant, recombinant or humanised. The antibody may be of any immunoglobulin 
isotype, e.g., IgG, IgM, and so forth. 

In a preferred aspect, the antibody fragments may be camelised Vh domains. It is known 
that the main intermolecular interactions between antibodies and their cognate antigens 
are mediated through Vh CDR3. However, VH-only antibodies, such as those derived 
from camel or llama (natrually VH.only single chain antibodies), have only low affinity 
for cognate antigen. 

The present invention provides for the oligomerisation of Vh domains, or Vh CDR3 
domains, to produce a high-affinity antibody. Two or more domains may be included in 
an oligomer according to the invention; in an oligomer based on a cpnlO scaffold, up to 7 
domains may be included, forming a hetpameric antibody molecule (heptabody). 
Advantageously, the antibody domains are arranged in a seven-membered ring formation, 
based on the cpnlO scaffold. 

Receptor ligands Many ligand-receptor pairs depend on dimerisation for activation 
of the receptor. Examples include the insulin and erythropoietin receptors. The fimction 
of the ligand is to dimerise the receptor, which leads to autophosphorylation and hence 
activation of the receptor. Whilst some ligands, such as substance P, are short 
polypeptides, others (including kinase and phosphatase substrates) are complex molecules 
which possess binding loops projecting from the surface thereof Short peptides or loops 
may be incorporated into oligomers according to the present invention to form a 
polyvalent receptor ligand or kinase/phosphatase substrate, useful for activating or 
inhibiting receptors and/or kinases at very low concentrations. 

Variation may be introduced into the heterologous polypeptides inserted into the scaffold 
m order to map the specificity of receptors or kinases/phosphatases for their 
ligands/substrates. Variants may be produced of the same loop, or a set of standard 
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different loops may be devised, in order to assess rapidly the specificity of a novel 
kinase/phosphatase. Variants may be produced by randomisation of sequences according 
to known techniques, such as PCR. They may be subjected to selection by a screening 
protocol, such as phage display, before incorporation into protein scaffolds in accordance 
with the invention. 



Enzymes Numerous biological reactions involve the sequential, and/or synergistic, 
action of a plurality of protein activities. Such protein activities may be incoiporated into 
a single molecule in accordance with the present invention. Preferably, therefore, the 
monomers which are used to compose the oligomer according to the invention 
incorporate amino acid sequences which encode distinct biological activities. The 
activities are advantageously complementary, such that they are required sequentially in a 
biological reaction, or act synergistically. The invention therefore provides plurifiintional 
macromolecular structures. 



In a preferred embodiment, the heterologous amino acid sequences are antibiotics. This 
provides an antibiotic molecule with any desired spectrum of activity. 

Configurations of Ohgomers according to the Invention 

Figures 8-12 show various topologies and applications for scaffolded polypeptides in 
accordance with the present invention. In figures 8 and 9, the possible insertion sites for 
heterologous polypeptides are shown. Insertion of polypeptides may be performed by 
any suitable technique, including those set forth by Doi and Yanagawa (FEES Letters 
(1999) 457:1-4). As set forth therein, insertion of polypeptides may be combined with 
randomisation to produce libraries of polypeptide repertoires, suitable for display and 
selection. 



Figure 10 illustrates the potential attachment sites for polypeptides to a circular scaffold, 
in this case Gp31 . Reading fi-om left to right, the figure shows: no attachment, attachment 
to the mobile loop, attachment to the roof p hairpin, attachment at both the mobile loop 
and the roof p hairpin, attachment at the C terminus, attachment at both N and C termini, 
attachment at both N and C termini and the mobile loop, and attachment at both N and C 
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termini, the roof |3 hairpin and the mobile loop. As will be apparent, further 
configurations are possible, and can be combined in any way in the heptamer, leading to a 
total of 5.4 X 10^ possible configurations. 

Figure 11 shows a number of applications of scaffolded polypeptides, including 
oligomerisation of antibody binding domains, optionally including, a label such as GFP, 
and potentially purification tags. Moreover, the scaffold can be sued as a basis for 
peptide libraries, which may be selected to identify a desired activity. 



Figure 12 illustrates further appHcations for scaffolded polypeptides, including the 
formation of heterooligomers having a plurality of different functionalities, and the use of 
a circularly permuted subunit which is incapable of assembly into a ring due to N and C 
terminus separation, to screen for possible binding pairs; polypeptides placed at the N and 
C termini will restore ring-forming abihty if they bind, and thus restore the function of a 
1 5 cpn 1 0 chaperonin. 

Recombinant DNA techniques 



The present invention advantageously makes use of recombinant DNA technology in 
order to construct polypeptide monomers and oligomers. Advantageously, polypeptide 
monomers or oligomers may be expressed from nucleic acid sequences which encode 



them. 



As used herein, vector (or plasmid) refers to discrete elements that are used to introduce 
heterologous DNA into cells for either expression or repUcation thereof Selection and 
use of such vehicles are well within the skill of the artisan. Many vectors are available, 
and selection of appropriate vector will depend on the intended use of the vector, i.e. 
whether it is to be used for DNA amplification or for DNA expression, the size of the 
DNA to be inserted into the vector, and the host ceil to be transformed with the vector. 
Each vector contains various components depending on its function (amplification of 
DNA or expression of DNA) and the host cell for which it is compatible. The vector 
components generally include, but are not limited to, one or more of the following: an 
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origin of replication, one or more marker genes, an enhancer element, a promoter, a 
transcription termination sequence and a signal sequence. 

Both expression and cloning vectors generally contain nucleic acid sequence that enable 
5 the vector to replicate in one or more selected host cells. Typically in cloning vectors, this 
sequence is one that enables the vector to replicate independently of the host 
chromosomal DNA, and includes origins of replication or autonomously rephcating 
sequences. Such sequences are well known for a variety of bacteria, yeast and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative 
10 bacteria, the 2m plasmid origin is suitable for yeast, and various viral origins (e.g. SV 40, 
polyoma, adenovirus) are useful for cloning vectors in mammalian cells. Generally, the 
origin of replication component is not needed for mammalian expression vectors unless 
these are used in mammalian cells competent for high level DNA replication, such as 
COS cells. 

15 

Most expression vectors are shuttle vectors, i.e. they are capable of replication in at least 
one class of organisms but can be transfected into another class of organisms for 
expression. For example, a vector is cloned in E. coli and then the same vector is 
transfected into yeast or mammalian cells even though it is not capable of replicating 
20 independently of the host cell chromosome. DNA can be amplified by PGR and be 
directly transfected into the host cells without any replication component. 

Advantageously, an expression and cloning vector may contain a selection gene also 
referred to as selectable marker. This gene encodes a protein necessary for the survival or 
25 growth of transformed host cells grown in a selective culture medium. Host cells not 
transformed with the vector containing the selection gene will not survive in the culture 
medium. Typical selection genes encode proteins that confer resistance to antibiotics and 
other toxins, e.g. ampicillin, neomycin, methotrexate or tetracycline, complement 
auxotrophic deficiencies, or supply critical nutrients not available from complex media. 

30 

As to a selective gene marker appropriate for yeast, any marker gene can be used which 
facilitates the selection for transformants due to the phenotypic expression of the marker 
gene. Suitable markers for yeast are, for example, those conferring resistance to 
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antibiotics G418, hygromycin or bleomycin, or provide for prototrophy in an auxotrophic 
yeast mutant, for example the URA3, LEU2, LYS2, TRPl, or HISS gene. 

Since the replication of vectors is conveniently done in E. coli, an E. coli genetic marker 
5 and an E. coli origin of replication are advantageously included. These can be obtained 
from E. coH plasmids, such as pBR322, Bluescript® vector or a pUC plasmid, e.g, pUC18 
or pUC19, which contain both E. coli replication origin and E. coh genetic marker 
conferring resistance to antibiotics, such as ampicillin. 

10 Suitable selectable markers for mammalian cells are those that enable the identification of 
cells which have been transformed, such as dihydrofolate reductase (DHFR, methotrexate 
resistance), thymidine kinase, or genes conferring resistance to G418 or hygromycin. The 
mammalian cell transformants are placed under selection pressure which only those 
transformants which have taken up and are expressing the marker are uniquely adapted to 

15 survive. In the case of a DHFR or glutamine synthase (GS) marker, selection pressure 
can be imposed by culturing the transformants under conditions in which the pressure is 
progressively increased, thereby leading to amplification (at its chromosomal integration 
site) of both the selection gene and the linked DNA that encodes the polypeptide 
according to the invention. Amplification is the process by which genes in greater 

20 demand for the production of a protein critical for growth, together with closely 
associated genes which may encode a desired protein, are reiterated in tandem within the 
chromosomes of recombinant cells. Increased quantities of desired protein are usually 
synthesised from thus amplified DNA, 

25 Expression and cloning vectors usually contain a promoter that is recognised by the host 
organism and is operably linked to the heterologous nucleic acid coding sequence. Such a 
promoter may be inducible or constitutive. The promoters are operably linked to the 
coding sequence by inserting the isolated promoter sequence into the vector. Many 
heterologous promoters may be used to direct amplification and/or expression of the 

30 coding sequence. The term "operably linked" refers to a juxtaposition wherein the 
components described are in a relationship permitting them to fiinction in their intended 
marmer. A control sequence "operably linked" to a coding sequence is ligated in such a 
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way that expression of the coding sequence is achieved under conditions compatible with 
the control sequences. 

Promoters suitable for use with prokaryotic hosts include, for example, the P-lactamase 
5 and lactose promoter systems, alkaline phosphatase, the tryptophan (trp) promoter system 
and hybrid promoters such as the tac promoter. Their nucleotide sequences have been 
published, thereby enabling the skilled worker operably to ligate them to the coding 
sequence, using linkers or adapters to supply any required restriction sites. Promoters for 
use in bacterial systems will also generally contain a Shine-Delgamo sequence operably 
10 linked to the coding sequence. 

Preferred expression vectors are bacterial expression vectors which comprise a promoter 
of a bacteriophage such as phagex or T7 which is capable of functioning in the bacteria. 
In one of the most widely used expression systems, the nucleic acid encoding the fusion 

15 protein may be transcribed from the vector by T7 RNA polymerase (Studier et al. 
Methods in Enzymol. 185; 60-89, 1990). In the E. coli BL21(DE3) host strain, used in 
conjunction with pET vectors, the T7 RNA polymerase is produced from the >L-lysogen 
DE3 in the host bacterium, and its expression is under the control of the IPTG inducible 
lac UV5 promoter. This system has been employed successfully for over-production of 

20 many proteins. Altematively the polymerase gene may be introduced on a lambda phage 
by infection with an int- phage such as the CE6 phage which is commercially available 
(Novagen, Madison, USA), other vectors include vectors containing the lambda PL 
promoter such as PLEX (Invitrogen, NL) , vectors containing the trc promoters such as 
pTrcHisXpressTm (Invitrogen) or pTrc99 (Pharmacia Biotech, SE) , or vectors containing 

25 the tac promoter such as pKK223-3 (Phamiacia Biotech) or PMAL (new England 
Biolabs, MA, USA). 

Moreover, the coding sequence according to the invention preferably includes a secretion 
sequence in order to facilitate secretion of the polypeptide from bacterial hosts, such that 
30 it will be produced as a soluble native peptide rather than in an inclusion body. The 
peptide may be recovered from the bacterial periplasmic space, or the culture medium, as 
appropriate. 
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Suitable promoting sequences for use with yeast hosts may be regulated or constitutive 
and are preferably derived from a highly expressed yeast gene, especially a 
Saccharomyces cerevisiae gene. Thus, the promoter of the TRPl gene, the ADHI or 
ADHII gene, the acid phosphatase (PH05) gene, a promoter of the yeast mating 
5 pheromone genes coding for the a- or a-factor or a promoter derived from a gene 
encoding a glycolytic enzyme such as the promoter of the enolase, glyceraldehyde-3- 
phosphate dehydrogenase (GAP), 3-phospho glycerate kinase (PGK), hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triose phosphate isomerase, phosphoglucose 

10 isomerase or glucokinase genes, the S. cerevisiae GAL 4 gene, the S. pombe nmt 1 gene 
or a promoter from the TATA binding protein (TBP) gene can be used. Furthermore, it is 
possible to use hybrid promoters comprising upstream activation sequences (UAS) of one 
yeast gene and downstream promoter elements including a functional TATA box of 
another yeast gene, for example a hybrid promoter including the UAS(s) of the yeast 

15 PH05 gene and downstream promoter elements including a functional TATA box of the 
yeast GAP gene (PH05-GAP hybrid promoter). A suitable constitutive PH05 promoter is 
e.g. a shortened acid phosphatase PH05 promoter devoid of the upstream regulatory 
elements (UAS) such as the PH05 (-173) promoter element starting at nucleotide -173 
and ending at nucleotide -9 of the PH05 gene. 

20 

Transcription from vectors in mammalian hosts may be controlled by promoters derived 
from the genomes of viruses such as polyoma virus, adenovirus, fowlpox virus, bovine 
papilloma virus, avian sarcoma virus, cytomegalovirus (CMV), a retrovirus and Simian 
Virus 40 (SV40), from heterologous mammalian promoters such as the actin promoter or 
25 a very strong promoter, e.g. a ribosomal protein promoter, provided such promoters are 
compatible with the host cell systems. 

Transcription of a coding sequence by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are relatively orientation and position 
30 independent. Many enhancer sequences are known from mammahan genes (e.g. elastase 
and globin). However, typically one will employ an enhancer from a eukaryotic cell virus. 
Examples include the SV40 enhancer on the late side of the replication origin (bp 100- 
270) and the CMV early promoter enhancer. The enhancer may be spliced into the vector 
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at a position 5' or 3* to the coding sequence, but is preferably located at a site 5' from the 
promoter. 

Advantageously, a eukaryotic expression vector may comprise a locus control region 
(LCR). LCRs are capable of directing high-level integration site independent expression 
of transgenes integrated into host cell chromatin, which is of importance especially where 
the coding sequence is to be expressed in the context of a permanently-transfected 
eukaryotic cell line in which chromosomal integration of the vector has occurred, in 
vectors designed for gene therapy apphcations or in transgenic animals. 

Eukaryotic expression vectors will also contain sequences necessary for the termination 
of transcription and for stabilising the mRNA. Such sequences are commonly available 
from the 5* and 3' untranslated regions of eukaryotic or viral DNAs or cDNAs. These 
regions contain nucleotide segments transcribed as polyadenylated fragments in the 
untranslated portion of the mRNA. 

An expression vector includes any vector capable of expressing nucleic acids that are 
operatively linked with regulatory sequences, such as promoter regions, that are capable 
of expression of such DNAs. Thus, an expression vector refers to a recombinant DNA or 
RNA construct, such as a plasmid, a phage, recombinant virus or other vector, that upon 
introduction into an appropriate host cell, results in expression of the cloned DNA. 
Appropriate expression vectors are well known to those with ordinary skill in the art and 
include those that are replicable in eukaryotic and/or prokaryotic cells and those that 
remain episomal or those which integrate into the host cell genome. For example, nucleic 
acids may be inserted into a vector suitable for expression of cDNAs in mammalian cells, 
e.g. a CMV enhancer-based vector such as pEVRF (Matthias, et aL, (1989) NAR 17, 
6418). 

Construction of vectors according to the invention employs conventional hgation 
techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the 
form desired to generate the plasmids required. If desired, analysis to confirm correct 
sequences in the constructed plasmids is performed in a known fashion. Suitable methods 
for constructing expression vectors, preparing in vitro transcripts, introducing DNA into 
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host cells, and performing analyses for assessing expression and function are known to 
those skilled in the art. Gene presence, amplification and/or expression may be measured 
in a sample directly, for example, by conventional Southern blotting, Northern blotting to 
quantitate the transcription of mRNA, dot blotting (DNA or RNA analysis), or in situ 
5 hybridisation, using an appropriately labelled probe which may be based on a sequence 
provided herein. Those skilled in the art will readily envisage how these methods may be 
modified, if desired. 

The invention also envisages the administration of polypeptide oligomers according to the 
10 invention as compositions, preferably for the treatment of diseases associated with protein 
misfolding. The active compound may be administered in a convenient manner such as 
by the oral, intravenous (where water soluble), intramuscular, subcutaneous, intranasal, 
intradermal or suppository routes or implanting (e.g. using slow release molecules). 
Depending on the route of administration, the active ingredient may be required to be 
15 coated in a material to protect said ingredients fi-om the action of enzymes, acids and 
other natural conditions which may inactivate said ingredient. 

In order to administer the combination by other than parenteral administration, it will be 
coated by, or administered with, a material to prevent its inactivation. For example, the 
20 combination may be administered in an adjuvant, co-administered with enzyme inhibitors 
or in liposomes. Adjuvant is used in its broadest sense and includes any immune 
stimulating compound such as interferon. Adjuvants contemplated herein include 
resorcinols, non-ionic surfactants such as polyoxyethylene oleyl ether and n-hexadecyl 
polyethylene ether. Enzyme inhibitors include pancreatic trypsin. 

25 

Liposomes include water-in-oil-in-water CGF emulsions as well as conventional 
liposomes. 

The active compound may also be administered parenterally or intraperitoneally. 
30 Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures 
thereof and in oils. Under ordinary conditions of storage and use, these preparations 
contain a preservative to prevent the growth of microorganisms. 
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The pharmaceutical forms suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. In all cases the form must be 
sterile and must be fluid to the extent that easy syringability exists. It must be stable 
5 under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, and liquid polyetheylene gloycol, and the like), 
suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for 
10 example, by the use of a coating such as lecithin, by the maintenance of the required 
particle size in the case of dispersion and by the use of superfactants. 

The prevention of the action of microorganisms can be brought about by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic 
15 acid, thirmerosal, and the like. In many cases, it will be preferable to include isotonic 
agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable 
compositions can be brought about by the use in the compositions of agents delaying 
absorption, for example, aluminium monostearate and gelatin. 

20 Sterile injectable solutions are prepared by incorporating the active compound in the 
required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilisation. Generally, dispersions 
are prepared by incorporating the sterilised active ingredient into a sterile vehicle which 
contains the basic dispersion medium and the required other ingredients from those 

25 enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum drying and the freeze-drying 
technique which yield a powder of the active ingredient plus any additional desired 
ingredient from previously sterile-filtered solution thereof. 

30 When the combination of polypeptides is suitably protected as described above, it may be 
orally administered, for example, with an inert diluent or with an assimilable edible 
carrier, or it may be enclosed in hard or soft shell gelatin capsules, or it may be 
compressed into tablets, or it may be incorporated directly with the food of the diet. For 
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oral therapeutic administration, the active compound may be incorporated with excipients 
and used in the form of ingestible tablets, buccal tablets, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like. The amount of active compound in such 
therapeutically useful compositions in such that a suitable dosage will be obtained. 

The tablets, troches, pills, capsules and the like may also contain the following: a binder 
such as gum tragacanth, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and the 
like; a lubricant such as magnesium stearate; and a sweetening agent such as sucrose, 
lactose or saccharin may be added or a flavouring agent such as peppermint, oil of 
wintergreen, or cherry flavouring. When the dosage unit form is a capsule, it may 
contain, in addition to materials of the above type, a liquid carrier. 

Various other materials may be present as coatings or to otherwise modify the physical 
form of the dosage unit. For instance, tablets, pills, or capsules may be coated with 
shellac, sugar or both. A syrup or elixir may contain the active compound, sucrose as a 
sweetening agent, methyl and propylparabens as preservatives, a dye and flavouring such 
as cherry or orange flavour. Of course, any material used in preparing any dosage unit 
form should be pharmaceutically pure and substantially non-toxic in the amounts 
employed. In addition, the active compound may be incorporated into sustained-release 
preparations and formulations. 



As used herein "pharmaceutically acceptable carrier and/or diluent" includes any and all 
solvents, dispersion media, coatings, antibacterial and antiflingal agents, isotonic and 
absorption delaying agents and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active ingredient, use thereof in the 
therapeutic compositions is contemplated. Supplementary active ingredients can also be 
incorporated into the compositions. 

It is especially advantageous to formulate parenteral compositions in dosage unit form for 
ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete units suited as unitary dosages for the mammalian subjects to be 
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treated; each unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the novel dosage unit forms of the invention are dictated by 
and directly dependent on (a) the unique characteristics of the active material and the 
particular therapeutic effect to be achieved, and (b) the hmitations inherent in the art of 
compounding such as active material for the treatment of disease in living subjects having 
a diseased condition in which bodily health is impaired. 

The principal active ingredients are compounded for convenient and effective 
administration in effective amounts with a suitable pharmaceutically acceptable carrier in 
dosage unit form. In the case of compositions containing supplementary active 
ingredients, the dosages are determined by reference to the usual dose and manner of 
administration of the said ingredients. 

In a further aspect there is provided the combination of the invention as hereinbefore 
defined for use in the treatment of disease. Consequently there is provided the use of a 
combination of the invention for the manufacture of a medicament for the treatment of 
disease associated with aberrant protein/polypeptide structure. The aberrant nature of the 
protein/polypeptide may be due to misfolding or unfolding which in turn may be due to 
an anomalous e.g. mutated amino acid sequence. The protein/polypeptide may be 
destabilised or deposited as plaques e.g. as in Alzheimer's disease. The disease might be 
caused by a prion. A polypeptide-based medicament of the invention would act to 
renature or resolubilise aberrant, defective or deposited proteins. 

The invention is further described below, for the purposes of illustration only, in the 
following examples. 

Examples 



1. General Experimental Procedures 
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Bacterial and bacteriophage strains. The E. coli strains used in this study were: 
C41(DE3), a mutant of BL21(DE3) capable of expressing toxic genes (Miroux, B. & 
Walker, J. E. (1996) J. Mol. Biol. 260, 289-298); SV2 {:Q\l%groEL44), SV3 
(B178gro£Z5P) and SV6 (B178gro£Z(575): isogenic strains carrying temperature- 
sensitive alleles of groEL; SV1(=B178) (Georgopoulos. C, Hendrix, R. W., Casjens, S. 
R. & Kaiser, A. D. (1973) J. Mol. Biol. 76, 45-60), AI90 (AgroEL::kan^) [pB AD-EL] 
(Ivic, A., Olden, D., Wallington, E. J. & Lund, P. A. (1997) Gene 194, 1-8), and TGI 
(Gibson, T. J. (1984) Ph.D. thesis. University of Cambridge, U.K). Bacteriophage X b2cl 
(Zeilstra-Ryalls, J., Fayet, O., Baird, L. & Georgopoulos, C. (1993) J. Bacteriol. 175, 
1134-1143) was used according to standard methods (Arber, W., Enquist, L., Hohn, B., 
Murray, N. E. & Murray, K. (1983) in Lambda II., ed. R. W. Hendrix, J. W. r., F. W. 
Stahl and R. A. Weisberg (Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.), 
pp. 433-466); plaque formation was assayed at 30 °C. T4D0, a derivative of bacteriophage 
T4 (Zeilstra-Ryalls, J., Fayet, O., Baird, L. & Georgopoulos, C. (1993) J. Bacteriol. 175, 
1134-1143), was used according to standard methods (Karam, J. D. (1994) Molecular 
biology of bacteriophage T4. (American Society for Microbiology, Washington, DC)); 
plaque formation was assayed at 37 °C. 

Plasmid constructions. Standard molecular biology procedures were used (Sambrook, J., 
Fritsch, E. F. & Maniatis, T. (1989) Molecular Cloning. A Laboratory Manual (Cold 
Spring Harbor Laboratory Press, N.Y.)). The schematic organisation of the plasmids used 
in this study is represented Figure 2. Gp3J gene was PGR (polymerase chain reaction) 
amplified using two oligonucleotides 5' - C TTC AGA CAT ATG TCT GAA GTA CAA 
CAG CTA CC - 3- and 5' - TAA CGG CCG TTA CTT ATA AAG ACA CGG AAT 
AGC - 3- producing a 358 bp DNA using pSV25 (van der Vies, S., Gatenby, A. & 
Georgopoulos, C. (1994) Nature 368, 654-656) as template. The DNA sequence of a part 
of the mobile loop of Gp31 (residues 25 to 43) was removed by PGR, as described 
(Hemsley, A., Amheim, N., Toney, M. D., Cortopassi, G. & Galas, D. J. (1989) Nucleic 
Acids Res. 17, 6545-6551), using oligonucleotides 5' - GGA GAA GTT CCT GAA CTG 
- 3' and 5' - GGA TCC GGC TTG TGC AGG TTC - 3', creating a unique BamH I site 
(bold characters). GroEL gene minichaperone (corresponding to the apical domain of 
GroEL, residues 191 to 376; (Zahn, R., Buckle, A. M., Ferret, S., Johnson, C. M. J., 
Corrales, F. J., Golbik, R. & Fersht, A. R. (1996) Proc. Natl. Acad. Sci. U.S.A. 93, 
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15024-15029)) was amplified by PGR using oligonucleotides, containing a BamH I site 
(underlined), 5' - TTC GGA TCC GAA GOT ATG CAG TTC GAG G - 3' and 5' - GTT 
^CG AAG GGG GGG TGG GAG TTT C - 3' and cloned into the unique BamH I 
site of pRSETA-Gp31Aloop vector, inserting minichaperone GroEL(191-376) in frame 
into Gp31Aloop sequence. The single ring GroELsRi mutant contains four amino acid 
substitutions (R452E, E461A, S463A, and V464A) into the equatorial interface of 
GroEL, which prevent the formation of double rings (Weissman, J. S., Hohl, C. M., 
Kovalenko, O., Kashi, Y., Chen, S., Braig, K., Saibil, H. R., Fenton, W. A. & Horwich, 
A. L. (1995) Cell 83, 577-587). The corresponding mutations were introduced into groEL 
by PGR (Hemsley, A., Amheim, N., Toney, M. D., Gortopassi, G. & Galas, D. J. (1989) 
Nucleic Acids Res. 17, 6545-6551) using oligonucleotides 5' - TGA GTA GGA TGT 
GTT GGA GGG GAG GTT GG - 3' and 5' - ATT GGG GGG AAG G GG GGG G TG 
GTG TTG GTA AGA GGG - 3' and pRSETA-£a^ I GroEL or GroESL vectors 
(Ghatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 
9861-9866) as template; silent mutations, in respect to the codon usage in E. coli, create a 
uniques Mfe I (bold characters) and Nae I (underlined). Gro£Z:(E191G; groEL44 allele) 
gene was PGR amplified from E. coli SV2 strain (Zeilstra-Ryalls, J., Fayet, O., Baird, L. 
«& Georgopoulos, G. (1993) J. Bacteriol. 175, 1 134-1 143) using two oligonucleotides 5' - 
T AGG TGG GAT ATG GGA GGT AAA GAG GTA AAA TTC GG - 3' and 5' - ATG 
TAA CGG GGG TTA CAT CAT GGG GGG CAT GGG AGG - 3' producing a 1,659 bp 
DNA with unique sites for Nde I and Eag I (underlined). The different genes were 
subcloned into the unique Nde I and Eag I unique sites of pAGYG184, pJC and pBAD30 
(Guzman, L.-M., Belin, D., Carson, M. J. & Beckwith, J. (1995) J. Bacteriol. 177, 4121- 
4130) vectors (Ghatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. 
Sci. U.S.A. 95, 9861-9866). A colony-based PGR procedure was used to identify the 
positive clones (Ghatellier, J., Mazza, A., Brousseau, R. & Vemet, T. (1995) Analyt. 
Biochem. 229, 282-290). PGR cycle sequencing using fluorescent dideoxy chain 
terminators (Applied Biosystems) were performed and analysed on an Applied 
Biosystems 373A Automated DNA. All PGR amplified DNA fragments were sequenced 
after cloning. 



Proteins expression purification and characterisation. The GroE proteins, -57.5 kDa 
GroEL and -10 kDa GroES, were expressed and purified as previously described 
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(Chatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 
9861-9866; Corrales, F. J. & Fersht, A. R. (1996) Folding & Design 1, 265-273). 
GroELsRi mutant was expressed and purified using the same procedure used for the wild- 
type GroEL; GroELsRi mutant was separated from emdogenous wild-type GroEL by 
ammonium sulfate precipitation at 30% saturation. GroEL(E191G) protein was expressed 
by inducing the Pbad promoter of pBAD30 based vector with 0.2 % arabinose in E. coli 
SV2 strain (Zeilstra-Ryalls, J., Fayet, O., Baird, L. & Georgopoulos, C. (1993) J. 
Bacteriol. 175, 1134-1143). Purification was performed essentially as described 
(Corrales, F. J. & Fersht, A. R. (1996) Folding & Design 1, 265-273). Residual peptides 
bound to GroEL proteins were removed by ion-exchange chromatography on a MonoQ 
column (Pharmacia Biotech.) in presence of 25 % methanol. The over-expression of 
histidine-tagged (short histidine tail; sht)-minichaperone GroEL(191-376) in E. coli 
C41(DE3) cells and, the purification and the removal of sht by thrombin cleavage were 
carried out essentially as previously described (Zahn, R., Buckle, A. M., Perret, S., 
Johnson, C. M. J., Corrales, F. J., Golbik, R. & Fersht. A. R. (1996) Proc. Natl. Acad. Sci. 
U.S.A. 93, 15024-15029). Gp31 proteins wild-type (-12 kDa), Aloop (-10.4 kDa) and 
MC, (-30.6 kDa), were expressed by inducing the T7 promoter ofpRSETA-Eag I based 
vectors with isopropyl->5-D-thiogalactoside (IPTG) in E. coli C41(DE3) (Miroux, B. & 
Walker, J. E. (1996) J. Mol. Biol. 260, 289-298) overnight at 25 °C. Purification 
procedures were essentially as described (van der Vies, S., Gatenby, A. 8c Georgopoulos, 
C. (1994) Nature 368, 654-656; Castillo, C. J. & Black, L. W. (1978) J. Biol. Chem. 253, 
2132-2139). Ammonium sulfate precipitation (only 20% saturation for Aloop; 30 to 70% 
saturation for wild-type and MC7) was followed by ion-exchange chromatography on a 
DEAE-Sepharose column (Pharmacia Biotech.). Gp31 proteins were eluted with a 0-0.5 
M NaCl gradient in 20 mM Tris-HCl, 1 mM EDTA, 1 mM ;5^mercaptoethanol, pH 7.5; 
Aloop and MC7 eluted between 0.32-0.44 and 0.38-0.48 mM NaCl, respectively. Gp31 
proteins were further purified by gel filtration chromatography on a Superdex^M 200 
(Hiload 26/10) column (Pharmacia Biotech.) equilibrated with 100 mM Tris-HCl, pH 7.5 
and, dialysed against and stored in 50 mM Tris-HCl, 0.1 mM EDTA, 1 mM fi- 
mercaptoethanol, pH 7.5. Proteins were analysed by electrospray mass spectrometry. 
Protein concentration was determined by absorbance at 276 nm using the method of Gill 
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& von Hippel (Gill, S. C. & von Hippel, P. H. (1989) Analyt. Biochem. 182, 319-326) 
and confirmed by quantitative amino acid analysis. 

Constitutive expression under the control of the tetracycline-resistance gene promoter / 
operator was obtained using the high copy-number pJC vectors (Chatellier, J., Hill, F., 
Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci, U.S.A. 95, 9861-9866). pBAD30 
vector allows inducible expression with 0.2-0.5 % arabinose controlled by the I^bad 
promoter and its regulatory gene, araC (Guzman, L.-M., Belin, D., Carson, M. J. & 
Beckwith, J. (1995) J. Bacteriol. 177, 4121-4130). The level of expression of MCy was 
analysed by 15% sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS- 
PAGE) under non-reducing conditions followed by Western blotting as described 
(Chatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 
9861-9866). 

Molecular weight determination by analytical gel filtration chromatography and 
analytical ultracentrifugation. One hundred ^il aliquots of protein (1 mg.mL"^) were 
loaded onto a Superdex™ 200 (HR 10/30) column (Pharmacia Biotech.) equilibrated with 
50 mM Tris-HCl, 150 mM NaCl, pH 7.5 at 0.5 mL.min"^ at 20 °C. The column was 
calibrated using gel j&ltration standards from Pharmacia Biotech, (thyroglobulin, 
MW=669 kDa; ferritin, MW=440 kDa; aldolase, MW=158 kDa; ovalbulmin, MW=45 
kDa; chymotrypsinogen MW=25 kDa; RNase, MW=13 kDa). Molecular weights were 
determined by logarithmic interpolation. 

Sedimentation analysis was performed in 50 mM Tris-HCl, 2.5 mM DTE (dithio- 
erythritol), pH 7.2 at 20 ""C with protein concentration in the range 45-300 ^iM, scanning 
at 280 rmi, with a Beckman XL-A analytical ultracentrifuge, using an An-60Ti rotor. 
Sedimentation equilibrium experiments were at 10,000 rev.min'^ with overspeeding at 
15,000 rev.min'^ for 6 hours to speed the attainment of equilibrium. Scans were taken at 
intervals of 24 hours, until successive scans superimposed exactly, when the later scan 
was taken as being operationally at equilibrium. To evaluate the apparent average 
molecular weight, data were fitted by non-linear regression. 

Circular dichroism spectroscopy (CD). Far UV (200-250 nm)-CD spectra at 25 °C 
were measured on a Jasco J720 spectropolarimeter interfaced with a Neslab PTC-348WI 
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water bath, using a thennostatted cuvette of 0.1 cm path length. Spectra are averages of 
10 scans and were recorded with a sampling interval of 0.1 nm. Thermal denaturation was 
carried out from 5-95 °C at a linear rate of 1 °C.min"' and monitored at 222 nm. The 
reversibility was checked after incubation at 95 °C for 20 min and cooling to and 
equilibration at 5 °C. The protein concentration was 45 ^iM in 10 mM sodium phosphate 
buffer pH 7.8, 2.5 mM DTE (dithioerythrol). 

GroES binding and competition assays by ELISA (enzyme-linked immunosorbant 
assay). Proteins were coated onto plastic microtitre plates (Maxisorb, Nunc) overnight at 
4 °C at a concentration of 10 ^ig/mL in carbonate buffer (50 mM NaHCOs, pH 9.6). 
Plates were blocked for 1 hour at 25 °C with 2% Marvel in PBS (phosphate buffered 
saline: 25 mM NaH2P04, 125 mM NaCl, pH 7.0). GroES. at 10 ^ig/mL in 100 ^iL of 10 
mM Tris-HCl, 200 mM KGl, pH 7.4, were bound at 25 °C for 1 hour. Bound GroES were 
detected with rabbit anti-GroES antibodies (Sigma) followed by anti-rabbit 
immunoglobulins horseradish peroxidase conjugated antibodies (Sigma). 

A peptide corresponding to the mobile loop of GroES (residues 16 to 32, numbered as in 
Hemmingsen, S. M., Woolford, C, van, d. V. S., Tilly, K., Dennis, D. T., Georgopoulos, 
C. P., Hendrix, R. W. & Ellis, R. J. (1988) Nature 333, 330-334) was synthesized as 
described (Chatellier, J., Buckle, A. M. & Fersht, A. R. (1999) J. Mol. Biol., in press). 
The inhibition of the binding of MC7 proteins by the free peptide was analysed by ELISA, 
essentially as above, by adding different concentrations (between 10,000 to 0.1 fiM) of 
free peptide solved in 0.1% TEA solution to 1 ^g of proteins prior incubation to coated 
GroES proteins (10 |ag/mL). GroEL molecules were detected with rabbit anti-GroEL 
antibodies (Sigma) followed by anti-rabbit immunoglobulins horseradish peroxidase 
conjugate antibodies (Sigma). ELISAs were developed with 3',3',5',5'- 
tetramethylbenzidine (TMB, Boehringer Mannheim). Reactions were stopped with 50 ^1 
of IM H2SO4 after 10 min and readings taken by subtracting the O.D.650 nm from the 
OD.450 nm- 

Anti-GroEL antibodies binding by ELISA. The same amount of proteins (1 )ig) were 
coated as described above. GroEL molecules were detected with either (/) rabbit anti- 
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GroEL horseradish peroxidase conjugate antibodies (9 mg/mL; Sigma) or (ii) rabbit anti- 
GroEL antibodies (11.5 mg/mL; Sigma) followed by anti-rabbit immunoglobulins 
horseradish peroxidase conjugate antibodies (Sigma). ELISAs were developed as 
described above. 



In vitro refolding experiments. Refolding assays of pig heart mitochondrial malate 
dehydrogenase (mtMDH; Boehringer-Mannheim) and aggregation protection were 
carried out essentially as described (Peres Ben-Zvi, A. P., Chatellier, J., Fersht, A. R. & 
Goloubinoff, P. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 15275-15280). The 
concentrations of MC7 used were between 8-16 |j.M (reporter to protomer). 

In vivo complementation experiments. Complementation experiments were performed 
by transforming electro-competent SV2 or SV6 cells with the pJC series of expression 
vectors and plating an aliquot of the transformation reactions directly at 43 °C. The 
percentage of viable cells relative to the growth at 30 °C was determined. A 
representative number of clones which grew at 43 °C were incubated in absence of any 
selective markers at permissive temperature. After prolonged growth the loss of the pJC 
plasmids and the ts phenotype were verified. Each experiment was performed in 
duplicate. Plasmids carrying no groE genes or encoding the GroE proteins were used as 
negative or positive controls, respectively. 

PI transduction (Miller, J. H. (1972) Experiments in Molecular Genetics (Cold Spring 
Harbor, N.Y.)), using strain AI90 (AgroEL::kan^) [pB AD-EL] as donor (Ivic, A., Olden, 
D., Wallington, E. J. & Lund, P. A. (1997) Gene 194, 1-8), was used to delete the groEL 
gene of TGI cells transfected by the different pJC vectors. Transductants were selected 
on LB plates containing 10 ^ig/mL of kanamycin at 37 °C. Approximately 25 colonies 
were transferred onto plates containing kanamycin at 50 jig/mL. After incubation for 24 h 
at 37 °C, colonies that grew were screened by PGR as described. 

AI90 iAgroEL::kan^) [pBAD-EL] cells were transformed with the pJC vector series. 
Transformants were selected at 37 °C on LB supplemented with 50 ^g.m^' of kanamycin, 
120 ng.mL'' of ampicillin, 25 jig.mL"' of chloramphenicol and 0.2% L(+)arabinose. 
Depletion of GroEL protein was analysed at 37 °C by plating the same quantity of AI90 
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[pBAD-EL + pJC vectors] cells on LB plates containing 1% D(+)glucose or 
amount of arabinose. 



Each experiment was performed in triplicate. Plasmids carrying no groE genes or 
encoding the GroE proteins were used as negative or positive controls, respectively. 

Effect on Lorist6 replication of over-expressing of MC,. The effect of over-expressing 
Gp31 proteins from pJC vector series on the replication of the bacteriophage A. origin 
vector, Loriste (Gibson, T. J., Rosenthal, A. & Waterston, R. H. (1987) Gene 53, 283- 
286) in TGI (Gibson, T. J. (1984) Ph.D. thesis. University of Cambridge, U.K) or SVl 
(Georgopoulos, C., Hendrix, R. W., Casjens, S. R. & Kaiser, A. D. (1973) J. Mol. Biol. 
76, 45-60) cells was determined essentially as described (Chatellier, J., Hill, F., Lund, P. 
& Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 9861-9866). 

2. Example 1: Gp31 protein as a scaffold for displaying heptameric GroEL 
minichaperone. We describe a scaffold on which any polypeptide may be hung; as a 
result, the polypeptide is oligomerised. The scaffold is the bacteriophage T4 Gp31 (gene 
product) heptamer. The monomeric protein is 1 2 kDa, but it spontaneously forms a stable 
heptameric structure (90 kDa) of which the three-dimensional structure is known from X- 
ray crystallography (Hunt, J. F., van der Vies, S., Henry, L. & Deisenhofer, J. (1997) Cell 
90, 361-371). This illustrates that a highly mobile polypeptide loop (residues 25 to 43; 
Chatellier, J., Mazza, A., Brousseau, R. & Vemet, T. (1995) Analyt. Biochem. 229, 282- 
290) projects fi-om each subunit (Figure 1). The basis of the method is the substitution of 
this loop by a chosen peptide sequence. 

In an effort to increase the avidity of minichaperones for substrates, and consequently to 
improve their chaperonin-facilitated protein folding, we generated the fusion protein, 
Gp31Aloop::GroEL(191-376) (hereafter named MC7), where the mobile loop of Gp31 
was replaced by the sequence of minichaperone GroEL (residues 191 to 376) (Figure 2). 
MC7 was cloned downstream of the T7 promoter of pRSETAsht-£^2^ I vector (Chatellier, 
J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 9861-9866). 
After sonication, the soluble and insoluble fractions of IPTG-induced transfected 
C41(DE3) cells (Miroux, B. & Walker, J. E. (1996) J. Mol. Biol. 260, 289-298) were 
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analysed by SDS-PAGE. Most of MC, was present in the insoluble fraction. Insoluble 
material dissolved in 8 M urea was efficiently refolded by dialysis at 4 °C. MC7 was 
purified by ion-exchange and gel filtration chromatography. MC7 was over-expressed in 
C41(DE3) cells to give 0.25-0.5 g purified protein per L of culture. Purified MC, 
coincided to seven 30.6 kDa subunits of Gp31Aloop::GroEL(191-376) as determined by 
analytical size exclusion chromatography (Figure 3 a) and analytical ultracentrifugation 
(Figure 3 b); Gp31Aloop corresponds to a tetra-decamer (14 subunits). The introduction 
of a foreign polypeptide in the Gp31 scaffold does do prevent its oligomerisation ability. 
The electronic microscopy studies of MC7 revealed views that correspond to front views 
of oligomers with a diameter close to the one of GroEL (J.L. Carrascosa, J.C. & A.R.F., 
unpublished). The circular dichroism spectrum of MC7 indicated significant a-helical 
structure (Figure 4a). The thermal unfolding monitored by far UV-CD was reversible 
although more than one transition exist (Figure 4b). 

Bacterial GroES or the human mitchodrial HsplO homolgous oligomeri sable scaffolds 
have been also successfiilly used to oligomerise polypetides displayed in their mobile 
loops. 

3. Example 2: Binding to heptameric bacterial co-chaperonin, GroES. 

The functionality of MC7 was examined for binding to GroES, since the interaction 
between GroEL and GroES is known to be less favourable for one monomer than for the 
heptamer. MC7 bound specifically to GroES, conversely monomeric minichaperone 
GroEL(191-376) did not detectably bind the bacterial co-chaperonin (Figure 5a). 

The ability of a synthetic peptide corresponding to residues 16 to 32 of GroES mobile 
loop to displace bound GroES from MC7 was tested by competition ELISA. The 
synthetic GroES mobile loop peptide did inhibit the binding of MC7 with an IC50 of 10 
compared to 100 |aM for GroEL (Figure 5b). The apparent dissociation constant for 
the formation of the GroEL-GroES complex is low (10-^ M), which is compatible with 
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cycling of GroES on and off GroEL during chaperonin-assisted folding. On the other 
hand, GroELsRi (Weissman, J. S., Rye, H. S., Fenton, W. A., Beechem, J. M. & 
Horwich, A. L. (1996) Cell 84, 481-490) is unable to release GroES in the absence of 
signal transmitted via the binding of ATP to an adjacent ring ring. The 10-fold decrease 
of the affinity of MC7 for GroES may be sufficient for multiple binding and release 
cycles. 

Binding to antibodies. A major application of oligomerisable scaffolds is the conversion 
of the hung polypeptides to antigens. The oligomerisation improves both detection of 
antibodies against, and the induction of antibodies to, such antigens. Indeed, the avidity 
effect of the heptameric struture of MC7 was confirmed by analysing the binding of 
antibodies specific to GroEL (Figure 6); comparable detection levels were observed for 
GroEL and MC7 at the different concentrations of antibodies used. In addition, using 
affinity panning of immobilised MC7 for a large library of bacteriophages ("phage 
display") that display single-chain Fv (scFv) antibodies fragments, we selected 
recombinant monoclonal scFvs that recognized only and specifically GroEL(l 91-376), 
and not the scaffold, Gp31 Aloop (P. Wang, J.C., G. Winter & A.R.F., unpubHshed) 

In vitro activity of MC7. In vitro, heat- and dithiothreitol-denatured mitochondrial malate 
dehydrogenase (mtMDH) refolds in high yield only in the presence of GroEL, ATP, and 
the co-chaperonin GroES (Peres Ben-Zvi, A. P., Chatellier, J., Fersht, A. R. & 
Goloubinoff, P. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 15275-15280). Monomeric 
minichaperone GroEL(191-376) binds denatured mtMDH, proteecting is aggregation 
(Figure 7a) but, it is ineffective in enhancing the refolding rate (Figure 7b). Conversely, 
MC7, which protect further denatured mtMDH from aggregation (Figure 7a) is active in 
refolding denatured mtMDH (Figure. 7a) with a rate of 0.02 nM.min"', compared to 0.04 
nM.min"' for wild-type GroEL alone (Figure 7b). After 120 min, the yield of refolded 
mtMDH by MC7 is about 2.5-3 nM, compared to 6 nM of enzyme rescued by wild-type 
GroEL (Figure 7c). Although saturating concentration of GroES (4 \xM) does increase 
about 3- to 5-fold the rates at the beginning of the refolding reaction, a 10-fold decrease 
of the final yield was observed; indicating the absence of multiple cycles of binding and 
release of GroES to MC7 (data not shown). Nevertheless, MC? is more efficient than 



36 

GroELsRi mutant (Llorca, O., Perez-Perez, J., Carrascosa, J., Galan, A., Muga, A. & 
Valpuesta, J. (1997) J. Biol. Chem. 272, 32925-32932; Nielsen, K. L. & Cowan, N. J. 
(1998) Molecular Cell 2, 1-7; this study); remarkably, MC, is only 2-fold less active than 
wild-type GroEL in refolding a non-permissive substrate in vitro. 

4. Example 3: In vivo complementation of thermosensitive groEL mutant alleles 
at 43 °C. 

We sought complementation of two thermosensitive (ts) groEL mutants of E. coli at 43 
°C. E. coli SV2 has the mutation Glul91-*Gly in GroEL corresponding to groEL44 
allele, while SV6 carries the EL673 allele, which has two mutations, Glyl73^Asp and 
Gly337-»Asp. Complementation experiments were performed by transforming the 
thermosensitive (ts) E. coli strains SV2 or SV6 with the pJC series of expression vectors 
vector (Chatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. Acad. Sci. 
U.S.A. 95, 9861-9866) and plating an aliquot of the transformation reaction directly at 43 
°C. Subsequently, plasmids pJC from a representative number of individual clones 
growing at 43 °C were lost in the absence of continued chloramphenicol selection. Nearly 
all (> 95%) the cured clones were thermosensitive at 43 °C indicating the absence of 
recombination events for the reconstitution of wild-type groEL gene. The results obtained 
are qualitatively similar to those previously described (Chatellier, J., Hill, F., Lund, P. & 
Fersht, A. R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 9861-9866). Only minichaperone 
sht-GroEL(193-335) complements the defect in SV2. The defective groEL in SV6 was 
complemented by expression of minichaperone sht-GroEL(19 1-345), and less well by 
sht-GroEL(l 93-335). Conversely, MC7 and GroELsRi complement both temperature- 
sensitive E. coligroEL44 aiadgroEL673 alleles at 43 °C (Table 1). Colony-forming units 
were not observed for either strain at 43 °C with vectors either lacking inserts (pJCsht) or 
lacking GroEL(l 91-376) (pJCGp31Aloop). 

It has been suggested the higher stability of shortest minichaperone sht-GroEL(l 93-335) 
could be responsible for the complementation of groEL44 mutant allele. To test this 
eventuality, we purified GroEL(El 91 G; groEL44 allele) mutant and compared its thermal 
stability with the wild-type GroEL. We found no difference in stability between the 
mutant and the wild-type proteins in presence or absence of ATP. In addition, highly 
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stable functional mutants of GroEL (193-345) do not complement, as the parental 
minichaperone (Table 1), the defects in SV2 or even SV6. We concluded the thermal 
stability of minichaperone is not accountable for the complementation oigroEL defects. 

5 Table 1. Relative colony forming ability of transformed ts groEL44 or groEL673 E. coli 
strains at 43 °C. 



GroEL strains 



Plasmids pJC 


SV2 groEL44 


SV6 groEL673 


short his tag (sht) {ES-, EL') 


< 10-4 


< 10-4 


GroES(l-97) 


5 X 10-3 


< 10-4 


Gp31(l-lll) 


0.5 X 10-3 


< 10-4 


Gp31 Aloop 


< 10-4 


< 10-4 


GroEL(l-548) 


1 


1 


GroES-EL 


1 


1 


sht-GroEL(191-376) 


< 10-4 


< 10-4 


sht-GroEL(191-345) 


0.01-0.02 


0.07-0.09 


sht-GroEL(193-335) 


0.05-0.09 


0.03-0.05 


Gp31A;:GroEL(191-376) 


0.15-0.2 


0.1 



10 5. Example 4: In vivo complementation at 37 °C. 

The effects of MC7 on the growth at 37 °C of a strain of £. coli in which the chromosomal 
groEL gene had been deleted were analysed in two ways. First, we attempted to delete the 
gi'oEL gene of TGI which had been transformed with the different pJC MC7 vector by PI 
15 transduction. However, no transductants could be obtained where the groEL gene had 
been deleted, unless intact GroEL was expressed from the complementing plasmid. This 
is consistent with the known essential role of GroEL. Second, we analysed the 
complementation of AI90 {AgroEL::kan^) [pBAD-EL] E, coli strain. In this strain, the 
chromosomal groEL gene has been deleted and GroEL is expressed exclusively from a 
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plasmid-bome copy of the gene which can be tightly regulated by the arabinose '?bad 
promoter and its regulatory gene, araC. AraC protein acts as either a repressor or an 
activator depending on the carbon source used, ^bad is activated by arabinose but 
repressed by glucose (Guzman, L.-M., Belin, D., Carson, M. J. & Beckwith, J. (1995) J. 
5 Bacteriol. 177. 4121-4130). The AI90 [pBAD-EL] cells can not grow on medium 
supplemented with glucose at 37 °C (Ivic, A., Olden, D., Wallington, E. J. & Lund, P. A. 
(1997) Gene 194, 1-8). As minichaperones (Chatellier, J., Hill, F., Lund, P. & Fersht, A. 
R. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 9861-9866.), MC7 was unable to suppress this 
groEL growth defect (Table 2). We then determined whether MC7 could supplement low 

10 levels of GroEL fi-om transfected AI90 [pBAD-EL]. At 0.01% arabinose, cells transfected 
with pJC expressing sht alone, GpSlAloop or sht-GroEL(191-376), showed little colony 
forming ability (lees than 5%). But those containing pJC MC7 produced about 30% of the 
number produced in the presence of 0.2% arabinose. Thus, pJC MC7, but not 
pJCGroBLsRi, significantly supplements depleted levels of GroEL., about twice as pJC 

15 sht-GroEL(l 93-335) (Chatellier, J., Hill, F., Lund, P. & Fersht, A. R. (1998) Proc. Natl. 
Acad. Sci. U.S.A. 95, 9861-9866). 

Table 2. Plating ability of transformed AI90 {LgroELwkan^) [pBADEL] E. coli strain at 37 °C 
in presence of different amount of arabinose. 

% L(+)arabinose 



PlasmidspJC 0.15 0.10 0.01 0.00 

short his tag (sht) 
{ES-, EL-) 

Gp31Aloop 
GroEL (1-548) 
sht-GroEL (191-376) 



++ + +/- 



++ + -I-/- 



++ + +/. 
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Gp31A:: +++ +++ + 

GroEL (191-376) 

growth identical to that in presence of 0.2 % L(+)arabinose (100 %), in terms of 
both number and size; ++, about 50 % of the colonies relative to that in presence of 0.2 
% L(+)arabinose; +, about 30 % of the colonies; +A, <5% of the colonies and size 
reduced relative to that in presence of 0.2 % L(+)arabinose; no visible colonies. 

6, Example 5: Effect on bacteriophages X and T4 growth of over-expressing MC7. 

Bacteriophages X and T4 require the chaperonins GroES and GroEL for protein folding 
during morphogenesis (Zeilstra-Ryalls, J., Fayet, O. & Georgopoulos, C. (1991) Annu. 
Rev. Microbiol. 45, 301-325). Nine^ra^: alleles which fail to support X growth have been 
sequenced (Zeilstra-Ryalls, J., Fayet, O., Baird, L. & Georgopoulos, C. (1993) J. 
Bacteriol. 175, 1134-1143). We examined the ability of MC7, over-expressed from the 
constitutive tet promoter on a high-copy number vector (see Figure 2), to complement 
three mutant groEL alleles for plaque formation by X (b2cl) at 30 °C (Table 3) and T4 at 
37 °C (Table 4). 

The groE operon was named for its effects on the E protein of X (Georgopoulos, C, 
Hendrix, R. W., Casjens, S. R. & Kaiser, A. D. (1973) J. Mol. Biol. 76, 45-60). Although 
heat induction of the groE operon has been shown to decrease burst size of 
X bacteriophage in E. coli (Wegrzyn, A., Wegrzyn, G. & Taylor, K. (1996) Virology 217, 
594-597). In contrast, we showed that the over-expression of GroEL alone, which 
resulted in slower growth of the bacteria, suffices to inhibit X growth (Table 3). This 
effect was specific; over-expression of GroEL together with GroES caused only a four- 
fold drop in plaques. Over-expression of GroES alone had no effect. Minichaperone 
GroEL(191-376) had no effect on plaque counts in SVl (groE'^). Conversely, over- 
expression of MC7 prevents plaque formation by bacteriophage X'm SVl, but less 
markedly than GroEL (Table 3). It seems that the main effect of GroEL over-expression 
is mediated through the X origin, which requires two proteins, O and P. As with GroEL, 
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MC-j (or GroELsRi) inhibit the replication of the Lorist6 plasmid which use the 
bacteriophage X origin. The effect on Lorist6 shows that the unfoldase activity is also an 
essential part of GroEL activity in vivo. MC7 and minichaperones possess both, un- and 
folding, activities. GroEL over-expression gives weak complementation of X growth in 
5 SV2 {groEL44) and SV3 {groEL59\ Ser201^Phe). MC7 does not, but GroELsRi does 
complement any of the E, coli groEL mutant strains for bacteriophage X growth at 30 °C 
(Table 3). 

Bacteriophage T4 (T4D0) also requires a functional groEL gene, but encodes a protein 
10 Gp31 which can substitute for GroES. The requirement for GroEL can be distinguished 
genetically from A-'s requirement. Thus only two of the four groEL alleles fail to support 
T4 replication; these are also the two thermosensitive mutations EL44 and EL673 
(Zeilstra-Ryalls, J., Fayet, O., Baird, L. & Georgopoulos, C. (1993) J. BacterioL 175, 
1134-1143; Zeilstra-Ryalls, J., Fayet, O. & Georgopoulos, C, (1991) Annu. Rev. 
15 Microbiol. 45, 301-325). While over-expression of Gp31 allows T4 grov^h in all strains 
(only SV2 and SV6 strains normally do not allow T4 growth), over-expression of 
GpSlAloop inhibits T4 replication. On the other hand, MC7 does, as does GroELsRi, 
complement E. coli groEL mutant strains for bacteriophage T4 growth at 30 °C (Table 3). 

20 An unsual basis for thermo-sensitivity in a groEL mutation. Surprinsingly, over- 
expression of GroES demonstrates allele-specific complementation for X and T4 of 
GroEL44 (Glul91^Gly) mutant (Tables 3 & 4). The effect is nevertheless incomplete; 
plaques on SV2 [pJC GroES] are invariably smaller than on SVl, or SVl [pJCGroES], 
The E191G single mutation blocks the assembly of the head structure of bacteriophage X . 

25 A possible molecular basis for this allele-specificity hes in the nature of the groEL44 
mutation. The substitution of Glul91-^Gly in the hinge region between the intermediate 
and apical domains of GroEL presumably increases the flexibility of the hinge, and 
thereby, modulates a hinged conformational change in GroEL required for proper 
interaction with GroES. Indeed, the pivotment of the hinge region ensures proper 

30 interaction with GroES. For example, the mutant GroEL59 (Ser201->Phe in the same 
hinge region) in SV3 has low affinity for GroES. Over-expression of GroES will favour 
the formation of GroES-EL44 complex; we indeed also observed complementation of 
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SV2 for thermosensitivity and bacteriophages growth by over-expressing GroEL44 
mutant. Taking advantage of the GroES effect, we observed that GroEL minichaperones 
and MC7 all reduce both plaque size and number but, like GroEL, do not completely 
eliminate them in SV2 [pBADGroES]. 

GroEL44, purified to homogeneity, is effective in refolding heat- and DTT-denatured 
mitochondrial malate dehydrogenase in presence of ATP and saturating concentration of 
GroES. Surprinsingly, GroEL44 is as thermo-stable as the wild-type GroEL, indicating 
the mutation does not destabilize the overall conformation of the mutant. As anticipated 
from our in vivo genetic analysis, the affinity between GroEL44 and GroES is decreased 
at 37 °C and even more at higher temperature. 

Our results suggest that the groEL44 mutation changes the distribution of GroEL subunits 
between apical domain-open and closed conformations. To allow GroELsRi to release 
GroES in the absence of signal transmitted via the binding of ATP to an adjacent ring, we 
introduced the Glul91-^Gly mutation in GroELsRi, generating the GroELsR2 mutant. 
GroELsR2 is more efficient than MC7 and even more than GroELsRi in vitro and in vivo. 
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Table 3. Growth of bacteriophage X at 30 °C in transformed wild-type and groEL 
mutant strains. 







groEL strains 




Plasmids pJC 


SVl 


SV2 SV3 


SV6 






groEL44 GroEL59 


groEL673 


short his tag (sht) 


-H-+ 


- 


- 


GroES (1-97) 


-f-H- 


+++ 




Gp31 (1-111) 


-l-H- 




- 


GrpSlAloop 








GtoEL (1-548) 




+ ++ 


+/- 


sht-GroEL (191-376) 


-HH- 






Gp31A::GroEL(191- 
376) 


-1- 







J 
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Table 4. Growth of bacteriophage T4 at 37 °C in transformed wild-type and groEL 
mutant strains. 



Plasmids pJC 

short his tag (sht) 
EL') 

GroES (1-97) 



groEL strains 



SVl SV2 SV3 SV6 

(groEL'^) g^oEL44 groEL59 groEL673 



4-4- 



+4-4- 



4-4- 



Gp31 (1-111) 



Gp31Aloop 



GroEL (1-548) 



sht-GroEL (191-376) 



4-/. 



4~h+ 



4-4-+ 



+++ 



+++ 



++ 



Gp31A::GroEL(191. ++ + + +/- 

376) 

, normal plaque- forming ability relative to wild-type groELT strain, in terms of both 



number and size; ++, 5-fold fewer plaques relative to wild-type groEI^ strain, or both; +, 
10-fold fewer plaques, or plaque size reduced relative to wild-type groEI^ strain, or both; 
+/-, 10^-fold fewer plaques and plaque size reduced relative to wild-type groEl^ strain; -, 
no visible plaques (<10"'^). 
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Claims 

L A polypeptide monomer capable of oligomerisation, said monomer comprising an 
heterologous amino acid sequence inserted into the sequence of a subunit of an 
oligomerisable protein scaffold. 

2. A pol>peptide monomer according to claim 1, wherein the oligomerisable protein 
scaffold subunit it selected from the group consisting of bacteriophage T4 Gp31, 
Escherichia coli GroES and homologues thereof of the cpnlO family. 

3. A polypeptide monomer according to claim 1 or claim 2, wherein the heterologous 
amino acid sequence is inserted into the sequence of the oligomerisable protein 
scaffold subunit such that both the N and C termini of the polypeptide monomer 
are formed by the sequence of the oligomerisable protein scaffold subunit. 

4. A polypeptide monomer according to any preceding claim, wherein the 
heterologous amino acid sequence is inserted into the oligomerisable protein 
scaffold subunit by replacing one or more amino acids thereof. 

5. A polypeptide monomer according to claim 4, wherein the oligomerisable protein 
scaffold subunit is bacteriophage T4 Gp31 and the heterologous amino acid 
sequence is inserted into the oUgomerisable protein scaffold subimit by 
substantially replacing the mobile loop between amino acid positions 27 and 42. 

6. A polypeptide monomer according to claim 4, wherein the oligomerisable protein 
scaffold subunit is Escherichia coli GroES and the heterologous amino acid 
sequence is inserted into the oligomerisable protein scaffold subunit by 
substantially replacing the mobile loop between amino acid positions 19 and 29. 

7. A polypeptide monomer according to claim 4, wherein the oligomerisable protein 
scaffold subunit is bacteriophage T4 Gp31 and the heterologous amino acid 
sequence is inserted between positions 59 and 61 of the oligomerisable protein 
scaffold subunit. 
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A polypeptide monomer according to claim 4, wherein the oligomerisable protein 
scaffold subunit is Escherichia coli GroES and the heterologous amino acid 
sequence is inserted between positions 56 and 57 of the oligomerisable protein 
scaffold subunit. 

A polypeptide monomer according to claim 4, wherein the oligomerisable protem 
scaffold subunit is bacteriophage T4 Gp31 and heterologous amino acid sequences 
are inserted at both positions described in claims 5 and 7. 

A polypeptide monomer according to claim 4, wherein the oligomerisable protein 
scaffold subunit is Escherichia coli GroES and heterologous amino acid sequences 
are inserted at both positions described in claims 6 and 8. 

A polypeptide monomer according to claim 2, wherein the heterologous amino 
acid sequence is displayed at the N or C terminus of the oligomerisable protein 
scaffold subunit. 

A polypeptide oligomer comprising two or more polypeptide monomers according 
to any preceding claim, 

A polypeptide oligomer according to claim 12, which is a homooUgomer. 
A polypeptide oligomer according to claim 12, which is a heterooligomer, 

A polypeptide oligomer according to claim 14, wherein complementary biological 

activities are juxtaposed through the olipornerisatio-n of different polypeptide 
monomers. 

A polypeptide oligomer according to any one of claims 12 to 15, wherem the 
monomers are covalently crosslinked. 
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A polypeptide oligomer according to any one of claims 12 to 16, wherein the 
protein scaffold is in the form of a ring. 

A polypeptide oligomer according to claim 17, wherein the ring is a heptameric 
ring. 

A polypeptide monomer or oligomer according to any preceding claim, wherein 
the heterologous amino acid sequence is an immunogen. 

A polypeptide oligomer or monomer according to claim 19, wherein the protein 
scaffold is bacterial in origin. 

A polypeptide according to claiml9 or claim 20, for use in the detection or 
neutralisation of antibodies in vivo. 

Use of a polypeptide according to claim 19 or claim 20 for the detection or 
neutralisation of antibodies in vitro. 

A polypeptide according to claim 19 or claim 20, for use as a vaccine. 

A polypeptide monomer or oligomer according to any one of claims 1 to 18, 
wherein the heterologous amino acid sequence is an antibody or an antigen- 
binding fragment thereof 

A polypeptide according to claim 24, wherein the antibody fragment is a Vh 
domain. 

A polypeptide according to claim 25, wherein the antibody fragment is a Vh 
CDR3. 

A polypeptide according to any one of claims 24 to 26, for use in the detection or 
neutralisation of antigens in vivo. 
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28. Use of a polypeptide according to any one of claims 24 to 26 for the detection or 
neutralisation of antigens in vitro, 

29. A polypeptide monomer or oligomer according to any one of claims 1 to 18, 
wherein the heterologous amino acid sequence is a ligand for a receptor. 

30. A polypeptide monomer or oligomer according to any one of claims 1 to 18, 
wherein the heterologous amino acid sequence is a substrate for a kinase or 
phosphatase. 

31. A polypeptide according to claim 29 or claim 30, wherein the heterologous amino 
acid sequence comprises an at least partly randomised portion. 



32. A polypeptide monomer or oligomer according to any one of claims 1 to 18, 
15 wherein the heterologous amino acid sequence is capable of mediating a biological 

activity. 

33. A polypeptide according to claim 33, wherein the heterologous amino acid 
sequence is selected from the group consisting of an enzyme, an antibiotic, an 

20 enzyme inhibitor, a molecule involved in cell signalling, a hormone, an antigen, an 

immunogen, a nuclear localisation sequence, a cellular uptake sequence, a DNA 
binding sequence, a receptor and a ligand for a receptor. 

34. A polypeptide oligomer according to claim 32or 33, which comprises two or more 
25 different heterologous amino acid sequences having different biological activities. 

35. A nucleic acid sequence encoding a polypeptide according to any preceding claim. 

36. A method for preparing a polypeptide according to any one of claims 1 to 35, 
30 comprising the steps of inserting a nucleic acid sequence encoding a heterologous 

polypeptide into a nucleic acid sequence encoding a subunit of an oligomerisable 
protein scaffold, incorporating the resulting nucleic acid into an expression vector, 
and expressing the nucleic acid to produce the polypeptide monomers. 
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Abstract 

The invention relates to polypeptide monomer capable of oligomerisation, said 
monomer comprising an heterologous amino acid sequence inserted into the 
sequence of a subxmit of an oligomerisable protein scaffold. 
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